Live freelance tracking. Raw descriptions turned into structured data. Find your next tech project without the noise.
upwork.com 🟢 2026-05-04
🔹 [Target] Extract and categorize information from diverse websites [Method] Use Claude for complex web scraping and data extraction [UI/UX] Not applicable [Stack] Python with BeautifulSoup, Scrapy, or similar libraries; Natural Language Processing (NLP) for categorization [Security] Ensure data privacy and handle sensitive information securely [Format] JSON output with structured categories
👤 Client: 🇨🇦 Canada Member since 2023-12-19
💰 Price: ****
🚩 Problem: Need to extract and categorize complex information from diverse websites using Claude.
📦 Existing: Not specified.
Specifications:
[Target] Extract company acquisitions, investments, and related details
[Method] Develop a robust scraper with NLP for accurate categorization
[Stack] Python, BeautifulSoup, Scrapy, NLP libraries (e.g., spaCy)
[Security] Secure handling of sensitive data
[Format] JSON output
Workflow:
Analyze sample documents to understand required categories and patterns.
Develop a scraper using Python with appropriate libraries for web scraping and NLP.
Test the scraper on a subset of websites to ensure accuracy in extraction and categorization.
Refine the scraper based on testing results and expand to full dataset.
Extract data from 10,000+ websites and format into JSON.